Mindmap: Utilizing Multiple Taxonomies and Visualization to Understand a Document Collection

نویسندگان

  • W. Scott Spangler
  • Jeffrey T. Kreulen
  • Justin Lessler
چکیده

We present a novel system and methodology for browsing and exploring topics and concepts within a document collection. The process begins with the generation of multiple taxonomies from the document collections, each having a unique theme. These taxonomies then become an integral tool in the exploration of the document collection. It is assumed that the user of our system may have only a vague notion of exactly what they are attempting to understand, and would like to explore related topics and concepts rather than simply being given a set of documents. For this purpose, we have developed the MindMap interface to the document collection. Starting from an initial keyword query, the MindMap interface helps the user to explore the concept space by first presenting the user with related terms and high level topics in a radial graph. After refining the query by selecting any related terms, one of the related high level concepts can be selected for further investigation. The MindMap uses a novel binary tree interface to explore the composition of a concept based on the presence or absence of terms. From the binary tree a concept can be further explored and visualized. Individual documents are presented as spatial coordinates where distance between points relates to document similarity. As the user browses this spatial representation, text is presented from the document that is most relevant to the user’s initial query. Individual points can be selected to pull up the relevant paragraphs from the document with the keywords highlighted. Finally, selected documents are displayed and the user is allowed to further interact and investigate.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognizing Emergent Nodes in Aligning Multiple Document Taxonomies

A document taxonomy alignment method, relying on document glosses and utilizing a soft ontology expansion, enables us to devise some allnew hierarchical leaf nodes for the purpose of better aligning a plurality of document taxonomies.

متن کامل

Using Visualization for Information Management Tasks

Taxonomies are a powerful modelling tool when building interfaces for disclosing large information repositories. However, their actual use is far from trivial; tasks such as creation, instantiation and maintenance of taxonomies are often difficult and time-consuming. We present a number of ways in which the Cluster Map, a component for the visualization of instantiated taxonomies, can help in t...

متن کامل

Text to Multi-level MindMaps: A Novel Method for Hierarchical Visual Abstraction of Natural Language Text

MindMapping [45] is a well-known technique used in note taking, and is known to encourage learning and studying. MindMapping has been manually adopted to help present knowledge and concepts in a visual form. Unfortunately, there is no reliable automated approach that can generate MindMaps from Natural Language text. This work firstly introduces MindMap Multilevel Visualization concept which is ...

متن کامل

Constructing Task-Specific Taxonomies for Document Collection Browsing

Taxonomies can serve as browsing tools for document collections. However, given an arbitrary collection, pre-constructed taxonomies could not easily adapt to the specific topic/task present in the collection. This paper explores techniques to quickly derive task-specific taxonomies supporting browsing in arbitrary document collections. The supervised approach directly learns semantic distances ...

متن کامل

Resolving Task Specification and Path Inconsistency in Taxonomy Construction

Taxonomies, such as Library of Congress Subject Headings and Open Directory Project, are widely used to support browsing-style information access in document collections. We call them browsing taxonomies. Most existing browsing taxonomies are manually constructed thus they could not easily adapt to arbitrary document collections. In this paper, we investigate both automatic and interactive tech...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002